Signal processing apparatus based upon selective conversion of audio signal to text signal

ABSTRACT

Signal processing apparatus for providing a conversion of audio and E-mail signals received from a network interface  13  and for supplying the signals either to an audio reception device  80  in the form of a telephone handset or to a video reception in the form of a video processor  8  for display on a television set  9  and, selectively, for converting the received signals for reception on the other reception device.

This application is a continuation of U.S. patent application Ser. No.08/819,096, filed Mar. 18, 1997 now U.S. Pat. No. 6,138,090, which areexpressly incorporated by reference herein in its entirety.

BACKGROUND AND FIELD OF THE INVENTION

The present invention relates to signal processing apparatus, moreparticularly but not exclusively to apparatus for providing access totelephone and internet systems.

Telephony apparatus and apparatus for providing access to the internetand more particularly E-mail services are well-known, telephony servicesbeing able to provide an interchange of information orally/aurallywhereas E-mail services provide a means for interchanging informationvisibly.

It is an inherent disadvantage of telephony and E-mail systems that theylack flexibility in terms of the medium of expression and accordingly itis an object of the invention to provide apparatus which may be used togive greater flexibility in the use of such communication systems.

SUMMARY OF THE INVENTION

According to the invention in a first aspect, there is provided aprocessing apparatus comprising means of processing data received from anetwork link; means for conversion of data from text to speech formatand vice versa; means for processing the data or the converted data foraural reception; means for processing the data or the converted data forvisual reception; control means for selecting a reception mode for thedata and instructing conversion of the data if the data is in anincorrect format.

Preferably the processing apparatus further includes apparatus for auralreception and transmission of the data which may be portable and maycomprise a keypad for generating telephone and appliance controlsignals, a speaker for aural output and a microphone for oral input.Most preferably, the oral reception and transmission apparatus is in theform of a telephone handset using a cordless radio frequency link.

Preferably the network link of the processing apparatus is a link to atelephone network.

According to the invention in a second aspect, there is provided signalprocessing apparatus comprising means for processing data received froma network link; means for conversion of data from a text format to aspeech format and vice versa; means for connection to an audiotransceiver; means for connection to a video receiver; means forconnection to a text input device; control means for selectivelyinstructing conversion of the signal by the data conversion means andinstructing output of the signal or the converted signal to therespective audio or video receiver connection means.

Preferably the signal processing apparatus further comprises means forconnection to a text input device and the data processing means isfurther for processing a data transmission signal to be transmitted overthe network link and the control means is further for receiving an audioinput signal from the audio transceiver connection means or a text inputsignal from the text input device connection means, selectivelyconverting the input signal to a text or audio signal and transmittingthe input signal or the converted input signal to the data processingmeans for transmission over the network link to a pre-determineddestination.

The signal processing apparatus may further comprise an audiotransceiver which may be in the form of a telephone handset.

The data processing means may comprise internet connection means fortransmitting, receiving and processing E-mail messages and the videoreceiver connection means may comprise a video processor connected tothe data processing means for processing the E-mail messages for videodisplay.

The data conversion means may comprise a text to speech processor and aspeech to text processor and may include a modem for connection to thetelephone network.

The signal processing apparatus preferably further comprises digitalmessage storing means for storing an aural message and the control meansmay further be for instructing output of the message as an audio signalto the data processing means for transmission over the network link. Thestored message may be a pre-recorded message or the control means mayfurther be for instructing selective conversion and storage of the datasignal as the message.

The signal processing apparatus may further be for instructing selectiveconversion and storage of the data signal as a text signal andsubsequently instructing output of the text signal to the dataprocessing means for transmission over the network link to apredetermined destination.

According to the invention in a third aspect, there is provided signalprocessing apparatus comprising: signal processing means comprisingmeans for connection to an audio transceiver; means for connection to atext input device; means to conversion of an audio signal to a textsignal and vice versa; data processing means for processing a datatransmission signal to be transmitted over the network link and; controlmeans for receiving an audio input signal from the audio transceiverconnection means or a text input signal from the text input deviceconnection means, selectively converting the input signal to a text oraudio signal and transmitting the input signal or the converted inputsignal to the data processing means for transmission over the networklink to a predetermined destination.

BRIEF DESCRIPTION OF THE DRAWINGS

An embodiment of the invention will now be described, by way of example,with reference to the accompanying drawings in which:

FIG. 1 is a block diagram of an embodiment of signal processingapparatus of the invention showing transfer of data and control signalsbetween a network, a console unit, a portable unit, a television, akeyboard and a mouse.

FIG. 2 is a block diagram showing the various components of the consoleunit.

FIG. 3 is a block diagram showing the various components of the portableunit.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT

In FIG. 1, a console unit 40 communicates with a portable unit 80. Theconsole unit 40 has a protective housing (not shown) but the portableunit 80 is preferably in the external form of a multi-purpose telephonehandset, for example as shown in U.S. Pat. No. 5,138,649.

The console unit 40, as depicted in FIG. 2, is powered by regulatedpower supply 14 and facilitates transfer of data and control signals toand from a generic network link 32. Data and control signals from thegeneric network link 32 are processed by an appropriate networkinterface box 13 before being received by the console unit 40 forprocessing. In a first preferred form, network link 32 is a link to aconventional telephone network and accordingly network interface box 13represents a modem. The data received from network link 32 may containinformation pertaining to E-mail messages or conventional telephonesignals.

For E-mail transfer, in response to a user request or after apredetermined time interval, the microcontroller 2 instructs dial-out toa remote server to perform the necessary steps to retrieve E-mailmessages from the server in the conventional manner. The receivedmessages are processed by serial interface module 12 in a conventionalmanner and sent to internet protocol controller 1 to be converted totext format. The E-mail data received by internet protocol controller 1on conversion to text can, on instruction by microcontroller 2, be sentto text-to-speech processing module 6. Text-to-speech processing module6 is a digital signal processor programmed with known techniques andconverts the text containing the incoming E-mail message to analog ordigitized speech. The incoming E-mail can, therefore, be in text formatat the internet protocol controller 1 and/or analog/digitized speechformat at the text-to-speech processing module 6.

Microcontroller 2 in console unit 40 is supported by memory storage 11.Memory storage 11 provides random access memory (RAM) for use bymicrocontroller 2 in a conventional manner. Memory storage 11 alsoprovides firmware, through read-only memory (ROM), for operatingmicrocontroller 2 and for processing data and signals received byconsole unit 40. Programs resident in memory storage 11 together withinstructions from microcontroller 2 determine the format of the data,that is text or analog/digitized speech. and various alternatives forinforming a user of the system, of incoming E-mail messages.

In a first alternative, the user can be informed of incoming E-mailmessages by programming microcontroller 2 to instruct telephony functionmodule 3, Which establishes communication links with the telephonenetwork via the modem, to dial-out a programmed telephone number atwhich the user may be contacted. Microcontroller 2 then instructsdigital answering machine 15, which is capable of storing speech, tosend a programmed speech message, via telephony function module 3 andnetwork link 32, alerting the user of incoming E-mail messages.Alternatively microcontroller 2 can instruct text-to-speech processingmodule 6 to convey the incoming E-mail message in analog speech formatto the user via telephony function module 3 and network link 32.

In a second alternative, a user may program the microcontroller 2 tosend incoming E-mail messages on conversion to text, to video processor8 for further conversion to appropriate video signals for display on aconventional television set or video monitor 9. The user may then readthe messages on the television set or video monitor 9 when convenient.The user may also instruct microcontroller 2 to store incoming E-mailmessages in memory storage 11 in text format for later retrieval by theuser on television set or video monitor 9.

In a third alternative, the user may be informed of incoming E-mailmessages by programming microcontroller 2 to forward the messages tocommunication module 17. Communication module 17 modulates the incomingE-mail messages, in analog/digitized speech format, for transmission bytransceiver unit 4 to portable unit 80.

FIG. 3 depicts portable unit 80 which is powered by a battery 24 viabattery management module 19. Transceiver unit 28 in the portable unit80 receives the modulated analog/digitized speech containing theincoming E-mail messages and further processing is determined bymicrocontroller 27.

Microcontroller 27 is supported by memory 18 in a manner similar tomicrocontroller 2 and memory storage 11 in console unit 40.Microcontroller 27 can be programmed to instruct communication module 29to demodulate the analog/digitized speech containing incoming E-mailmessages. Communication module 29 then ensures that the speechcontaining the E-mail messages is in analog format before conveying itto the user through speaker 22.

Alternatively microcontroller 27 is programmed by the user to activatemechanical vibrator 25 and/or buzzer 26 to notify the user of incomingE-mail messages.

The user may also dictate an E-mail message using portable unit 80 or aconventional telephone/mobile phone. If portable unit 80 is used fordictation of an E-mail message, the outgoing E-mail message in speechformat is sent from microphone 23 to communication module 29 where it ismodulated and transmitted to console unit 40 by transceiver unit 28.Transceiver unit 4 receives the modulated outgoing E-mail message andsends it to communication module 17 where it is demodulated and sent todigital answering machine 15 to be digitized and compressed. Thecompressed digitized speech containing the outgoing E-mail message isthe n sent to speech-to-text processing module 5 to be converted totext. Speech-to-text processing module is a digital signal processorprogrammed with known software, such as Dragon Dictate® for conversionof compressed digitized speech to text. On conversion to text, theoutgoing E-mail message is sent out to the remote server via internetprotocol controller 1, serial interface module 12 and the network link32.

If a conventional telephone/mobile phone is used for dictation of anE-mail message by a remote user, the outgoing E-mail message, receivedas a telephone message in speech format, is received by telephonyfunction module 3 in console unit 40 through network link 32 and sent todigital answering machine 15 to be compressed and digitized. Thecompressed digitized speech containing the outgoing E-mail message isthen converted to text by speech-to-text processing module 5 and sentout to the remote user via internet protocol controller 1, serialinterface module 12 and network link 32, as before.

The actuation of keys on the keypad of a conventional telephone/mobilephone or keys on keypad 21 of portable unit 80, in a predeterminedmanner identifiable by microcontroller 2, sends control signals toconsole unit 40 to inform microcontroller 2 if the speech being sent isconventional telephone speech or speech containing an outgoing E-mailmessage to be suitably processed. In the case of an E-mail message, suchkeys also can send recipient identification information, such as anabbreviated dial code that the handset or the microcontroller 2 can lookup in memory to determine the E-mail address or a handset which allowsthe alpha-numeric characters to be input from the keypad throughalternative key functions may be used. A possible format for such amessage would be:

# # Dictated E-mail message # *Direct entry or memory recall of E-mailaddress*# Alternatively, dedicated function keys on keypad 21 inportable unit 80 may be actuated specifically to send control signals toconsole unit 40, to inform microcontroller 2 if the speech signals beingsent is conventional telephone speech or speech containing an outgoingE-mail message to be suitably processed.

When the data received through network link 32 contains signalspertaining to conventional telephony, such as audio and control signals,telephony function module 3 establishes a communication link Withnetwork link 32 via network interface box 13 and informs microcontroller2. Programs resident in memory storage 11 and instructions frommicrocontroller 2 then determine how the telephone signals areprocessed.

In a first alternative, a system user can be informed of incomingtelephone calls by programming microcontroller 2 to instruct telephonyfunction module 3, after receipt of an incoming call, to dial-out aprogrammed telephone number at which the user may be contacted. Digitalanswering machine 15 then sends a programmed speech message, viatelephony function module 3 and network link 32, notifying the user ofnew telephone calls.

In a second alternative, the telephone signals may be relayed to theportable unit 80 held by the user, in the usual way. This is achieved byprogramming microcontroller 2 to instruct communication module 17 tomodulate the telephone signals and send them to transceiver unit 4. Thetelephone signals are then transmitted by transceiver unit 4 to portableunit 80 where they are received by transceiver unit 28, sent tomicrocontroller 27 and demodulated by communication module 29. The usermay then take the telephone call listening through speaker 22 andspeaking through microphone 23, the speech signals being modulated bycommunication module 29 and transmitted to console unit 40 bytransceiver unit 28.

Transceiver unit 4 in console unit 40 receives the telephone signals andsends them to communication module 17 for demodulation. The telephonesignals are then sent out to network link 32 via telephony functionmodule 3. Telephone control signals from keypad 21 may be sent in asimilar way to initiate a call and send control instructions.

In a third alternative, a user may program microcontroller 2 to instructdigital answering machine 15 to digitize and compress the incomingtelephone message and sent this to microcontroller 2. Microcontroller 2then forwards the compressed digitized speech signals to speech-to-textprocessing module 5 where they are converted to text before being storedin memory storage 11. The telephone message in text format is thenretrieved from memory storage 11, converted to appropriate video signalsby video processor 8 and may be read by the user on television set orvideo monitor 9.

Portable unit 80 can also be used as a remote control for television set9. The keys on keypad 21 in portable unit 80 can be actuated to generatetelevision control signals which are then sent by microcontroller 27 toconsole unit 40 upon modulation. Transceiver unit 4 in console unit 40then receives the television control signals which are then demodulatedby communication module 17 and forwarded to video processor 8 bymicrocontroller 2. The television set 9 is then controlled by thetelevision control signals sent by video processor 8 which receivestelevision signals from television antenna 34. Dedicated function keyson keypad 21 or a predetermined sequence of undedicated keys may beactuated to send control signals to console unit 40 to informmicrocontroller 2 of arriving television signals to be suitablyprocessed.

Console unit 40 and portable unit 80 are preferably equipped with liquidcrystal display (LCD) screens/light emitting diodes (LEDs) 10 and 20respectively which can be used to display information such as date, timeand number of E-mail and telephone messages. Console unit 40 preferablyalso provides for a keypad and mouse interface module 16 whichinterfaces with keyboard and mouse unit 7. The keyboard and mouse may beused to instruct microcontroller 2 to set defaults for treatment ofincoming and outgoing telephone and e-mail messages, as well as sendinge-mail messages in a conventional manner.

The embodiment of the invention described is not to be construed aslimitative. For example, although shown using a radio frequencycommunication link between console unit 40 and portable unit 80, theinvention is equally of use with an infra-red or hard-wiredcommunication link. Furthermore, network link 32 may be a Local AreaNetwork (LANl) link and accordingly network interface box may be asuitable card such as an Ethernet® card and internet protocol controllermay be replaced by an appropriate LAN protocol controller. As such it isto be understood that various modifications could be made to theembodiment described without departing from the scope of the inventionand that the invention is limited only by the scope of the appendedclaims.

What is claimed is:
 1. A signal processing apparatus for sending amessage from a sender to a recipient over a communication network, saidapparatus comprising: an audio transceiver; and a signal processorarranged to receive an audio input signal from the audio transceiver, toselectively convert the audio input signal to a text signal undercontrol of the sender and to transmit one of the audio input signal andthe converted input signal over the network to a predetermineddestination.
 2. The apparatus as claimed in claim 1, wherein the textsignal forms the content of an E-Mail message.
 3. The apparatus asclaimed in claim 2, wherein the E-Mail message further includes adestination address.
 4. The apparatus as claimed in claim 3, wherein thedestination address is received together with the audio input signal andthe signal processor includes a system for extracting the destinationaddress.
 5. The apparatus as claimed in claim 4, wherein the destinationaddress received with the audio input signal is in a form of a codecorresponding to the destination address and the apparatus furthercomprises a look-up table for storing the destination addresscorresponding to the code.
 6. The apparatus as claimed in claim 3,wherein the audio input signal and the destination address are separatedusing predetermined indicators.
 7. The apparatus as claimed in claim 5,wherein the indicators comprise key strokes of a telephone handset. 8.The apparatus as claimed in claim 6, wherein the key strokes comprisecombinations of # and *.
 9. The apparatus as claimed in claim 1, whereinthe audio transceiver is in a form of a telephone handset.
 10. Theapparatus as claimed in claim 1, further comprising a system forconnection to a text input device.
 11. The apparatus as claimed in claim10, further comprising a text input device connectable to the text inputdevice connection system.
 12. The apparatus as claimed in claim 11,further comprising a system that selectively converts a text signal toan audio signal under control of the sender and transmit one of the textsignal and the converted text signal over the network to a predetermineddestination.
 13. The apparatus as claimed in claim 1, further comprisinga video processor connected to the signal processor, for processing textmessages for video display.
 14. A signal processing apparatus forsending a message from a user to a recipient over a telephone networkcomprising: means for receipt of a speech and data signal over atelephonic link from the user at a location remote from the apparatus,the signal including a spoken message and an E-mail destination address;means for selective conversion, under control of the user, of the spokenmessage to a text signal; and means for transmission of the text signalover the network to the E-mail destination address.
 15. The apparatus asclaimed in claim 14, wherein the spoken message and the E-maildestination address of the message are separated using predeterminedindicators.
 16. The apparatus as claimed in claim 15, wherein theindicators comprise key strokes of a telephone handset.
 17. Theapparatus as claimed in claim 16, wherein the key strokes comprisecombinations of # and *.
 18. The apparatus as claimed in claim 14,wherein the E-mail destination address is in a form of a codecorresponding to the E-mail destination address and the apparatusfurther comprises a look-up table for storing the E-mail destinationaddress corresponding to the code.
 19. A signal processing apparatus forsending a message from a user of the apparatus to a recipient over acommunications network comprising: a system that connects the apparatusto a user end of the communications network; an audio transceiver; atext input device; a signal processing system arranged to convert anaudio signal to a text signal and vice versa, and arranged to receive anaudio input signal from the audio transceiver or a text input signalfrom the text input device, to instruct selective conversion of theinput signal to a text or audio signal and to transmit the input signalor the converted input signal over the communication network to apredetermined destination.
 20. The apparatus as claimed in claim 19,wherein the communication network is a telephone network and theconnection system connects to a subscriber end of the telephone network.21. The apparatus as claimed in claim 19, wherein the network is a localarea network.